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Abstract 

The Hilbert transform is essentially the only singular operator in one dimen- 
sion. This undoubtedly makes it one of the the most important linear operators 
in harmonic analysis. The Hilbert transform has had a profound bearing on sev- 
eral theoretical and physical problems across a wide range of disciplines; this 
includes problems in Fourier convergence, complex analysis, potential theory, 
modulation theory, wavelet theory, aerofoil design, dispersion relations and 
high-energy physics, to name a few. 

In this monograph, we revisit some of the established results concerning the 
global behavior of the Hilbert transform, namely that it is is weakly bounded 
on L 1 (R), and strongly bounded on L P (R) for 1 < p < oo, and provide a self- 
contained derivation of the same using real-variable techniques. This note is 
partly based on the expositions on the Hilbert transform in [ 1 , 3 ] . 



1 Introduction 

The Hilbert transform of a sufficiently well-behaved function /(x) is defined to be 



1 

$ef(x) = -lim 

71 £->0 



r 



dt 

fix-t)— . (1) 

i£|>E t 



It is not immediately not clear that ^4f/(x) is well-defined even for nice functions. 
Though {TJ "almost" looks like an ordinary convolution, there are however certain 
technical subtleties associated with the definition. The primitive idea behind the 
definition of the transform is quite simple, namely to transform /(x) by convolving 
with the kernel 1/nx. It is in doing so rigorously that one encounters technical 
difficulties - the kernel fails to be absolutely integrable owing to its slow decay 
and, more importantly, due to the singularity at the origin. The limiting argument 
in (Q]) is used to avoid the singularity by truncating the kernel around the origin 
in a systematic fashion. As will be shown shortly, this indeed works for sufficiently 
regular functions. The other pathology, namely the slow decay of the kernel, can 
be be circumvented relatively easily simply by restricting the domain of (Q} to 
functions having a sufficiently fast decay. 
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The non-trivial task in the study of the Hilbert transform is, in fact, the specification 
of the class of functions on which the sequence of integrals in (O can be given a 
precise meaning, either pointwise or in the norm sense. More precisely, one needs 
to show the integral 

1 f dt 
jej{x)=-\ /(x-t)— 

% h\>e 1 

is (absolutely) convergent for all e > and that (i) either #€J{x) converges for 
(almost) all x as e — » 0, which provides a pointwise specification of 3tff ; or (ii) 
that the sequence of functions 3tf E f converge in the norm to some function in V as 
e — » 0, which is then defined to be the Hilbert transform of / . 

Here we will focus only on the latter global characterization of M 1 , namely the fact 
that it is weakly bounded on L 1 (marginally fails to be bounded), and that it is 
strongly bounded on V = L P (R) for 1 < p < oo; the latter result was originally 
derived by M. Riesz using techniques from complex analysis [2j. We will however 
use real-variable techniques (see [3]]) and will particularly focus on the main ideas 
rather than the technical details. 



2 Details of the derivation 



To begin with, we restrict the domain of to the Schwartz class S = S(R) on 
which 3tff e {x) is well-defined for all x and for every e > 0. Indeed, following the 
fact that the p-th power of l/|t| is integrable outside the interval (— e,e) for all 
1 < p < oo, Holder's inequality tells us that #€ E f(x) exists for all e > 0. As far 
as the convergence is concerned, all we need to show is that the integral remains 
absolutely convergent even as e — > 0, or equivalently that 



lim sup 

E^O 



I/O - t)/t\dt < oo. 



t\>e 



To this end, we split the integral in (©, and use the anti-symmetric nature of 1/t 
to write 



.#70) = - lim 

1 

= — lim 



e<|t|<l 



dt 1 
/(x-t)— + - 
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dt 
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£<|t|<l 

Now, using the mean value theorem, we can write 

/(t)-/(0) 



/0-t)- 



dt 



\t\>i 



lim 

E^O 



-dt 



f'(x-9t)dt 



|t|<i 



E<|t|<l 

where 9 is a number between x and x — t. Since the derivative is bounded, we 
conclude that 



|t|>£ 



\f(x-t)/t\dt <2||/'|| 0O +||/|| 1 , 
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so that J?f{x) is indeed well-defined for all x. 

As a by product of the above observations, we note that the the Hilbert transform 
of a function can be defined pointwise provided the function exhibits sufficient 
regularity and decay; in particular, we can modify the above derivation to show 
that the Hilbert transform of compactly supported function with some Lipschitz 
regularity is always well-defined. 

Having established the validity of (Q]) for the Schwartz class, we next proceed with 
the derivation of the following global estimates for this class: 

(i) #e is weak (1, 1): 

m({x: WOOl > A}) < jH/ll!. (2) 

(ii) J*f is strong (2, 2): 

IWIl2 = ll/ll2- (3) 

That is, we will show that that 3*? takes L 1 into the so-called weak L 1 (a space larger 
than L 1 ), and that it maps S into L 2 in an isometric manner. Using the fact that 
S is dense in L p , we can then easily extend these estimates from the sense subclass 
S to the larger L p spaces (this is similar to the approximation technique used for 
extending the domain of the Fourier transform from either S or L 1 to L 2 ) . 

In particular, we will derive the weak (1, 1) inequality using the decomposition of 
Calderon and Zygmundand, and the strong (2,2) inequality using the theory of 
distributions and the properties of the Fourier transform on L 2 . The strong (p,p) 
boundedness for 1 < p < 2, will then be leveraged from these results using a 
powerful interpolation result. Finally, we will extend the strong (p,p) result for 
2 < p < oo using duality and the fact that J*f is skew-adjoint. 

Remark: We would like to emphasize on the fact that the foregoing account only 
tells us that is bounded on L p for 1 < p < oo (more precisely, bounded on a 
dense subclass and thus has a unique extension); it however does not settle the 
local problem, namely whether (0 makes sense for U functions in the notion of 
pointwise convergence, though it does offers confidence that this indeed must be 
the case (one needs to study the maximal version of the operator in this case, where 
the limit in (Q]) is replaced by the supremum of the absolute value). 

We will first derive the estimate in ^ and then the estimate in (O. We will con- 
tinue to stress on the main ideas rather than the technical details. 

2.1 Strong (2, 2) nature 

Before diving into the details, we would like to note that the bounded nature of #e 
on L 2 can be deduced (at least informally) using a argument based on the scaling 
property of the Fourier transform. Note that since 1/t is homogenous of degree — 1, 
its Fourier transform (1/t) (provided it indeed is a true function) must necessarily 
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be of degree 0, that is, it must be bounded. Thus, if we treat (0 as a convolution 
between /(t) and the kernel 1/t, then the convolution-multiplication rule along 
with the Parseval-Plancherel theorem gives us the estimate 

IWII2 = WWi = 11(1/0/112 = c||/|| 2 = c||/|| 2 

which establishes the fact that 3tf is bounded on L 2 (in fact, C = 1 as will be 
determined shortly). We will now rigorously show that is unitary on L 2 using 
the machinery of distributions (this can also be done using classical techniques 
based on complex analysis [2]]). 

Consider the distribution W on S, specified by 



<W,/) = -lim 

71 e^O 



\t\>e 



dt 



It is easily seen that W is linear. Moreover, one can show that the map / —* (W,f) 
is continuous from S to R so that W represents a valid distribution. Given a 
Schwartz function f(x~), note that (0 can be written as 

3tff(x)={W,T x f) 

where T x /(t) denotes the function (T x /)(t) = /(x — t). 

If we denote the Fourier transform of W by W and that of / by/, then we can use 
duality to write 



(W,f) = (W,f) 



= — lim 



= — lim 

n e->0 



r l/e 

\X\>E 

fl/c 

I 

|X|>£ 



/w- 



dx 



(chopping off the tail) 



\ dx 

f(co)e- }OJX dco)—. 

J x 



Interchanging the integrals, using Fubini, and applying dominated convergence, 
we have 



(W,/) = -lim 

71 e^O 



2J 

K 



/(")( 

J |jc|>£ 

r r 1 / £ 
/(w)flim 

J \X\>£ 



dx ) dco 

x J 

sm cox \ 

dx) 

x J 



dco 



2j [ n 
= /(") -sign(w)dw 

71 I Z 



(-jsign( co))f(co)dco. 
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This tells us that the Fourier transform of W is in fact a function, and is given by 



W(o>) = — jsign(co). 

This also means that the Fourier transform of (J*f/)(x) can then be expressed 
(using the convolution-multiplication rule) as 

Sff (w) = -jsign(&>)/(&>). 
Then using the Parseval-Plancherel identity, we have that 



\3Vf{oc)\ 2 dco. 



\f(co)\ 2 dco-- 



\fW\ 2 dx 



for all / in S. This, in particular, establishes the fact that and hence 3€f ' , is in 
L 2 for all / e S. In other words, takes S into L 2 , and that it is unitary: 



|^f/(x)[ 2 dx: 



|/(x)[ 2 dx (/ e S). 



This establishes the estimate in ([3]). 

We can now extend the domain of ^ from the dense subclass S to L 2 using a 
continuity argument. For example, given an arbitrary function / in L 2 , we consider 
an approximating sequence (/„) e S such that ||/ n — /|| 2 can be made arbitrarily 
small for sufficiently large n. Then, using the strong (2,2) one can easily verify 
that the sequence (•#?/„) is Cauchy in L 2 . We define Jtff to be the limit of this 
Cauchy sequence (this is known to exist and is unique) . This new operator, which 
we continue to denote by X , is also bounded: 



IWII 2 =ll/ll2 (for all /eL 2 ). 



2.2 Weak (1, 1) nature 

As it turns out, the Hilbert transform is not bounded on L 1 and we would have to 
use a completely different set of tools to describe its behavior on this space. Before 
going through the details, we will first highlight the main difficulties involved in 
the derivation of the estimate and the strategies we use to handle them. 

1. Control of measures using norms. Let us first comment on the reason why the 
bound in (EJ is termed as "weak". Note that using the Chebyschev inequality, we 
can write 

|{x:|/(x)|>A}|<^ 

provided that / is integrable (we use |A| to denote the Lebesgue measure of some 
measurable subset A of R). In particular, if T is a bounded operator on L 1 , so that 
IIT/Hj SCII/Hj for all/ in L 1 , we have 

|{x:|r/(x)|>A}|<^^<C^. 
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This tells us that T is also weakly bounded on L 1 . The reverse assertion however 
is not true in general. For the Hilbert transform, one can easily see that ffl is 
not bounded on L 1 by considering the indicator function X[o,i]M an d its Hilbert 
transform ^f^[o,i]( x )- An explicit computation shows that ^°X[o,i]C x ) decays only 
as 0(l/|x|) for large x (besides having "blow-ups" at and 1) and hence is clearly 
not integrable. 

2. Use of the weak bound. Since we have already shown M" to be bounded on 
L 2 , one can however hope to salvage the situation at least for the intermediate 
V spaces (1 < p < 2) by using an interpolation argument. This is exactly where 
the Marcinkiewicz interpolation theorem comes to the rescue, which roughly states 
that if T is a weakly bounded linear operator on V and L q , then T is strongly 
bounded on L r for all p < r < q. In particular, we will show that is weakly 
bounded on L 1 , whereby the fact that that 3*? is bounded on L p for 1 < p < 2 will 
be immediately established. 

3. Application on bounded integrable functions. In order to derive the weak 
bound, it is clear from the above discussion (particularly one on the Chebyschev 
inequality) that one would be required to bound integrals of the form 

- 

\jVf(x)\ a dx (n = l,2) 

by the L 1 norm of / (of course, assuming that #€f{x) exists almost everywhere). 
It is however not clear whether this can be done for all Schwartz (or integrable) 
functions. This can be achieved under two distinctive situations. 

The first among these is the case where the function g(x) is both integrable and 
bounded; one can then verify that g e V for all 1 < p < oo (this itself is a kind of 
interpolation result). Indeed, using the fact that J*f is strongly bounded on L 2 , we 
have for n = 2, 

j \^g{x)\ 2 dx = j \g(x)\ 2 dx < \\g\U\g\h. (4) 

4. Application on localized oscillating functions. The second case is the more 
interesting one which fundamentally relies on the odd nature of the kernel 1/x. 
This is the case when the function under consideration is well-localized and 
has a zero integral (oscillating). Indeed, if the support of b(x) is restricted to an 
interval / and if J b = 0, then we can entirely avoid the limiting argument in (Q]) 
to write 

z>(y) ) d y 

j Vx — y x — cJ 

provided that x lies outside J, where c denotes the centre of the interval. If we 
denote by 21 the interval having the same centre but twice the length as J, then we 



3tfb(x) = - 
n 



dy = - 



- y 



71 
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see that 



|^b(x)|dx < 



R\2I 



( \ \Ky)\ — —\dy)d> 

-fl b Mlff i lj n C ' i dx K 
Jt , v mo, x-y x-c ■/ 



< 



|6Cy)lf f ^^cZxldy 



since |y — c| < |7|/2 for all ye/, and |x — y| > |x — c|/2 for all x outside 21. The 
inner integral is computed to be 



R\27 



(x - cf 



rdx = 2, 



which provides us the the reasonable estimate 



\#eb(x)\dx < CUblli. 



(5) 



R\2J 



We have however avoided a certain neighborhood of the support of b(x) while 
evaluating the integral of |^fb(x)|. As will be seen shortly, this does not pose 
much of a problem since we can always control the size of this excluded interval 
by the norm of the function. 

We are now in a position to derive ([2]). We will do this only for non-negative 
functions; this will suffice since we can decompose any arbitrary function into its 
positive and negative parts, apply the result to each of them, and recombine the 
estimates. 

Following the above arguments, the main strategy would be to decompose the 
function /(x) into a bounded and integrable part g(x), and a series of localized 
oscillating bumps denoted by b(x). The following version of a classic result of 
Calderon and Zygmund tells us that every integrable function (Schwartz functions 
in particular) can indeed be resolved in this manner (cf. Appendix A for details) : 



The Caldero-Zygmund decomposition. Let / be an non-negative integrable func- 
tion on R and A be a positive number. Then there exists a sequence of almost 
disjoint intervals {I k } such that 

(i) /(x) < A for almost every x outside Q = [J k I k , 

(ii) The size of n is controlled by/, |fi| < A -1 !!/^, and 

(iii) For every I k , A < |7 k | _1 j I /(x)dx < 2A. 

Let us set 

J/00 forxefi c , 
8M= \\I k \- 1 ! I fix)dx forxe/,. 
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It is clear that g(x) < 2A and = \\f\\ v We then set b(x) =/(x) - g(x). This 
can be written as b(x) = ^ k b k (x), where each b k (x) is defined to be 



ifM-li^jjMdx \fx<=i k , 

i otherwise. 



It is clear that b k is supported on the interval I k where J b k (x)dx = 0, and that 
l|6|li< 2||/Ilx. 

Since 3tff(x) = Wg(x) + 3Vb(x), 

\{x : \JVf{x)\ > A}| < \{x : |^g(x)| > A/2}| + |{x : \j*b{x)\ > A/2}|. 
Using ([4]) and the fact that \\g\\i = \\f we can use Chebyschev to get 

|{x:|^g(x)|>A/2}|<^||/|[ 1 . (6) 

To estimate |{x : |^fb(x)| > A/2}|, we consider the union = [J k 2I k of size 
|n*| < 2|f2|. Then using estimate © and the fact thalQ \3tfb\ < Y, k almost 
everywhere, we have 



\{x : \3tfb{x)\ > A/2}| < |f2*| + \{x e R\n* : \3tfb{x)\ > A/2}| 
2 
A 



< dl/Hi+^KxeR^ : |^ fc (x)|>A/2}| 



<|ll/ll 1 + | 



r 

\3Vb k {x)\dx 

R\2/ t 



<^ll/lli- (7) 
Combining © and (0, we get 

\{x: Wf{x)\>k}\<j\\f\\ x . 

This establishes the desired weak (1, 1) bound for the Hilbert transform. Based 
on an approximation argument, similar to the one used earlier for extending the 
domain of tf? from S to L 2 and using the notion of convergence in measure instead 
of norm, we can define the Hilbert transform J*?/ of a function / in L 1 , which 
satisfies the estimate 

|{x:|^f/(x)|>A}|<^||/|| 1 . 



1 this is obvious if the sum is finite; there is some mild technicality involved in doing the same for an 
infinite sum. 
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2.3 Strong (p,p) nature using interpolation 



Since 3tf is linear and we have shown that it is weak (p,p) for p = 1 and 2, we can 
conclude from the Marcenkiewicz interpolation theorem that must be strong 
ip,p) for all 1 < p < 2 (cf. Appendix B for details). This result can be extended to 
2 < p < oo using duality between the conjugate spaces V and L q where 1/p+l/q = 
1. The particular result we need is that 



11/ II, 



sup 



f(x)g(x)dx 



In particular, for / e L q , 2 < q < oo, we can use this duality along with the fact 
that Jtf is skew adjoint (this can be established for the Schwartz class and then 
extended using the usual density argument), namely that 



to write 



\m\\ q 



sup 

kll P =i 



Wf{x)g{x)dx 



sup 

kll P =i 



3tfg(x)f(x)dx 

< sup ||^g|| p ||/|| ? 



(HZlder inequality) 
(J*? is bounded on L p ) 



<c||/|| r 

This establishes that Jtf is strong (_q,q) for 2 < q < oo. Note that the trick used in 
the above argument can be applied to any operator that is self-adjoint up to a sign, 
that is, to establish the boundedness of the operator on V, 1 < p < oo, it suffices 
to do so only on the V , 1 < p < 2, or on V , 2 < p < oo. 



Appendix A 

To keep this note as self-contained as possible, we formulate the particular ver- 
sion of the Calderon-Zygmund decomposition used in establishing the weak (1, 1) 
bound, namely that if f(x) is a non-negative integrable function and that A > 0, 
then there exists a sequence of almost disjoint (at most countable) intervals {/;.} 
such that the following hold: 

(i) f(x) < X for almost every x not belonging to \J k I k ; 

(ii) The total length of the intervals {/;.} is controlled by the norm of /, 

IU 7 *l^ii/iii- 
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(iii) The average of / on every I k is uniformly bounded, 



A< 



14 



/ < 2A. 



(9) 



This is the so-called Calderon-Zygmund decomposition of / at height A and can 
be achieved using the following dyadic decomspoition strategy. We begin by parti- 
tioning R into a mesh of intervals, whose interiors are disjoint and whose common 
length is so large that |J| _1 J / < A for every J in this mesh (this can clearly be 

achieved since |J| _1 j f approached zero as |J| gets large). 

Let I be a fixed interval in this mesh. We split I Q into two equal intervals. If we 
denote one of these intervals by l x , then we have two distinct possibilities, namely 
that either 

1 r 

f > A. 



or 



1 



f<\. 



In the former case, we do not split 7 : any further and J x is selected to be one of the 
intervals I k appearing in the decomposition. We have for it ([9]), because 



A< 



f< 



2" 1 |/n 



/ <2A. 



In the latter case, we split J x and repeat the process until we are forced into the 
former case (if this happens at all). We repeat this process starting with every 
interval from the initial mesh. Clearly, the resulting intervals I k are countable, and 
are almost disjoint by construction. To derive ([8]), we note that 



k k k J 



1 

/ = A 



U, 



f<j\\f\\, 



Finally, the fact that /(x) < A for almost every x outside [J k Ik can be deduced 
from the Lebesgue differentiation theorem, which states that the relation 



/(*) = 



lim - 

|/Ho 



f(x-y)dy 



(10) 



holds for almost every x if / is integrable (this is the Lebesgue counterpart of 
the fundamental theorem of calculus). Indeed, for every x belonging to the com- 
plement [J k I k and for sufficiently small / (this might not hold for certain larger 
intervals containing x; however this is inconsequential as we only need to consider 
the limiting case involving sufficiently small intervals), we have by construction 



1 



f{x-y)dy<X. 
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Taking limits as |/| — » and by applying (flOT) we see that /(x) < A for almost every 
x outside ljfc4- This establishes all the properties of the decomposition. 



Appendix B 



The weak (p,p) estimates for Jff for p = 1,2 can be leveraged to a strong (p,p) 
estimate for 1 < p < 2 using interpolation. The first step in doing so is by breaking 
up a function / e L p , for some 1 < p < 2, as / = g + h, where g and h are in L 1 and 
L 2 respectively. One way of achieving this decomposition is by truncating \f | on its 
range, that is, by setting g = f X{\f\>x\ and h = fX{\f\<x] ror some ^ > (typically 
h is the tail part of / and g is the singular part of / ) . Since 1 — p < and 2 — p > 0, 
we see that 



| /m i-p< A i- 



l/l>A 



\f\ p < A 1-p | 



\f\>X 



and 



|/|P|/| 2 "P < A 2 " p 



l/l<A 



l/l p <A 2 -P||/ll P , 



l/l<A 



so that g and h are indeed in the appropriate spaces. 

To proceed further, we need the concept of a distribution function. Let us denote 
the number |{x : |/(x)| > a}\, corresponding to some a > and some function /, 
by Df(a). The reason for introducing this function is that D f {X) contains sufficient 
information for the evaluation of the L p norm of / . Indeed, by setting E a = {x : 
|/(x)| > a}, we can write 



a p - l D f {a)da = p\ a p-1 ( XE a M dx ) da 
Jo Jr 

= [ ( [ pa''- 1 XE a Wd a )dx 

Jr Jo 
r r \f(x)\ 
= ( pa p - l da}dx 

Jr Jo 



which gives us the equivalence 



11/ ll| 



\fW\ p dx, 



a p_1 D f (a)da. 



Now, using the weak (p,p) estimates (p = 1,2), we have 



D* g W<j\\ g \\ 1 = l 



I/I- 



(11) 



|/|>A 
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Similarly, 



D xh {X)<—\\h\\ 2 2 < — 



l/l 2 . 



(12) 



l/l<A 



The rest of the computation is based on the properties of Df (a) and the interplay 
of the indices of the V spaces. In particular, since Jjf/(x) = g{x) + #€h{x), one 
can verify that D X ^{X) < D M . g [X/2) + D xh {X/2). By combining this with (fTTTl and 
(fl2l) . we have 



\m\\ p D =P 



a?- 1 D jef ta)da 



<p\ a?- 1 D Jtg (a/2)da + p a^D^a^d 



<B 1 



a p_2 ( |/| )da + B 2 a p - 3 ( l/| 2 )da 

J|/|>A Jo J\f\<& 

r\f\ C C°° 

( aP~ 2 da)\f\+B 2 ( aP~ 3 da)\f\ 2 



JR J\f\ 



= c\\f\\>. 



This shows that J>iC is bounded on V for 1 < p < 2. 



Remark: The above was a special case of the so-called Marcenkiewicz interpo- 
lation theorem. The above derivation can easily be extended to the most gen- 
eral form of the theorem which states that if T is sub-linear (i.e., \T(J + g)\ < 
\Tf\ + \Tg\~) and is both weak (p,p) and weak (q,q), then T is strong (r, r) for all 
l<p<r<q<oo. The fact that the weaker hypothesis of sub-linearity would 
suffice is also clear from the above derivation since the sub-additivity property 
D T (f +g -)(a) < D T f{_a) + D Tg (a) holds even when T is sub-linear. 
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